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Abstract 

We say that a graph with n vertices is c-Ramsey if it does not 
contain either a clique or an independent set of size clogn. We define 
a CNF formula which expresses this property for a graph G. We show 
a superpolynomial lower bound on the length of resolution proofs that 
G is c-Ramsey, for every graph G. Our proof makes use of the fact 
that every Ramsey graph must contain a large subgraph with some of 
the statistical properties of the random graph. 



Introduction 

Graphs with special properties often require non trivial and/or probabilistic 
constructions. Furthermore, once the graph is constructed it may be hard 
to verify that the property holds, and if such a graph is given to a new user 
without a suitable certificate, he must either verify the construction again 
or blindly trust the graph. 

In this paper we are interested in how hard it is to certify that a graph 
G of size n is c-Ramsey, that is, has no clique or independent set of length 
clogn. Constructing such graphs was one of the first applications of the 
probabilistic method in combinatorics. But the brute force approach to 
checking that G satisfies the property takes time n°( logn ) (compare the 
well-known hard problem of looking for cliques). We show that there is no 
resolution proof that G is c-Ramsey with length shorter than n°^ ogn \ This 
is not a worst-case result, but rather holds for every graph G. However we 
are only able to show this for what we call the "binary" formalization of 
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the Ramsey property as a propositional formula; for an alternative, "unary" 
formalization we only know a treelike resolution lower bound (see Section 
1.1). 

Notice that the lower bound on resolution proof size shows that the ver- 
ification problem is hard for quite a large class of algorithms, since most 
SAT solvers used in practice are essentially proof search algorithms for res- 
olution [18]. Notice also that, while it does not follow from the resolution 
lower bound that there is no algorithm which will construct a Ramsey graph 
in polynomial time, it does follow that, given such an algorithm, there is no 
polynomial-size resolution proof that the algorithm works. 

The finite Ramsey theorem states that for any k, there is some N such 
that every graph of size at least N contains a clique or independent set of 
size k. We write r(k) for the least such N. Computing the actual value of 
r(k) is challenging, and so far only a few values have been discovered. For 
this reason there is great interest in asymptotic estimates [12, 22, 10]. 

A c-Ramsey graph is a witness that r(clogn) > n, so proving that a 
graph is Ramsey is in some sense proving a lower bound for r(k). Previously, 
proof complexity has focused on upper bounds for r(k). Krishnamurthy and 
Moll [17] proved partial results on the complexity of proving the exact upper 
bound, and conjectured this formula to be hard in general. Krajfcek later 
proved an exponential lower bound on the length of bounded depth Frege 
proofs of the same statement [16]. The upper bound r(k) < 4 fc has short 
proofs in a relatively weak fragment of sequent calculus, in which every 
formula in a proof has small constant depth [20], [16]. Recently Pudlak [21] 
has shown a lower bound on proofs of r(k) < 4 k in resolution. We discuss 
this in more detail in Section 1. There are also results known about the off- 
diagonal Ramsey numbers r(k,s) where cliques of size k and independent 
sets of size s are considered. See [13, 1, 14, 8] for estimates and [9] for 
resolution lower bounds. 

In Section 1 we formally state our main result, mention some open prob- 
lems, and then outline the high-level method we will use. In Section 2 we 
apply this to prove a simple version of our main theorem, restricted to the 
case when G is a random graph. In Section 3 we prove the full version. This 
will use one extra ingredient, a result from [19] that every Ramsey graph 
G has a large subset with some of the statistical density properties of the 
random graph. 

1 Definitions and results 

Resolution [7] is a system for refuting propositional CNFs, that is, propo- 
sitional formulas in conjunctive normal form. A resolution refutation is a 
sequence of disjunctions, which in this context we call clauses. Resolution 
has a single inference rule: from two clauses AVx and B\/^x we can infer the 
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new clause A V B (which is a logical consequence) . A resolution refutation 
of a CNF is a derivation of the empty clause from the clauses of <f>. For an 
unsatisfiable formula 4> we define L(4>) to be the length, that is, the number 
of clauses, of the shortest resolution refutation of <f>. If 4> is satisfiable we 
consider L((f>) to be infinite. 

Let c > be a constant, whose value will be fixed for the rest of the 
paper. 

Definition 1 (Ramsey graph). We say that a graph with n vertices is 
c-Ramsey if there is no set of clogn vertices which form either a clique 
or an independent set. 

We now describe how we formalize this in a way suitable for the reso- 
lution proof system. Given a graph G on n = 2 k vertices, we will define a 
formula \E'g i n conjunctive normal form which is satisfiable if and only there 
is a homogeneous set of size ck in G, that is, if and only if G is not Ramsey. 
We identify the vertices of G with the binary strings of length k. In this 
way we can use an assignment to k propositional variables to determine a 
vertex. 

The formula has variables to represent an injective mapping from a 
set of ck "indices" to the vertices of G, and asserts that the vertices mapped 
to form either a clique or an independent set. It has a single extra variable 
y to indicate which of these two cases holds. 

In more detail, for each i € [ck] we have k variables x\,...,x\ which we 
think of as naming, in binary, the vertex of G mapped to by i. We have 
an additional variable y, so there are ck 2 + 1 variables in total. To simplify 
notation we will write propositional literals in the form u xl = 1" , u xl ^ 0" , 
"x\ = 0" and u x l b ^ 1". The first and the second are aliases for the literal 
xl. The third and the fourth are aliases for literal ^xl. 

The formula *&g then consists of clauses asserting the following: 

1. The map is injective. For each vertex v € V(G), represented as 
v i • • • v k in binary, and each pair of distinct i,j £ [ck], we have the 
clause 

k k 

\J{xi^v b )v\l{x{^v b ). 

6=1 6=1 
These clauses guarantee that no two indices i and j map to the same 
vertex v. 

2. If y = 0, then the image of the mapping is an independent set. 

For each pair of distinct vertices u, v £ V(G), represented respectively 
as u\ . . . uj~ and v\ . . . v^, and each pair of distinct i,j € [ck], if {u, v} G 
E(G) we have the clause 

k k 

yV\/(xi^u b )v\/(4^v b ). 
y=l 6=1 
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These clauses guarantee that, if y = 0, then no two indices are mapped 
to two vertices with an edge between them. 

3. If y = 1, then the image of the mapping is a clique. For each pair 
of distinct vertices u, v £ V(G), represented respectively as u\ . . . 
and v\ . . . Vk, and each pair of distinct i, j £ [ck], if {u,v} £ E(G) we 
have the clause 

k k 

6=1 6=1 

These clauses guarantee that, if y = 1, then no two indices are mapped 
to two vertices without an edge between them. 

Notice that the formula has ( c ^) (l + Q)) clauses in total, and so is unusual 
in that the number of clauses is exponentially larger than the number of 
variables. However the number of clauses is polynomial in the number n of 
vertices of G. 

If G is Ramsey, then ^>g is unsatisfiable and only has clog 2 n+1 variables. 
So we can refute \E , g m quasipolynomial size by a brute-force search through 
all assignments: 

Proposition 2. IfG is c-Ramsey, the formula \P<3 has a (treelike) resolution 
refutation of size n°^ ogn > . 

At this point, we should recall the formalization of the Ramsey theorem 
that is more usually studied in proof complexity. This is the family RAM n 
of propositional CNFs, where RAM n has one variable for each distinct pair 
of points in [n] and asserts that the graph represented by these variables is 
^-Ramsey. Hence RAM n is satisfiable if and only if any ^-Ramsey graph on 
n vertices exists. In contrast, our formula is satisfiable if and only if our 
particular graph G is not c-Ramsey. 

Put differently, a refutation of RAM n is a proof that r(k) < 2 2k . This 
was recently shown to require exponential size (in n) resolution refutations 
[21]. On the other hand a refutation of ^g is a proof that G is c-Ramsey, 

k 

and hence that G witnesses that r(k) > 2~. 

We now state our main result. We postpone the proof to Section 3. 

Theorem 3. Let G be any graph with n vertices. Then L{^g) — n n ^° sn \ 

If G is not c-Ramsey then this is trivial, since ^/q is satisfiable and there- 
fore L(^?g) is infinite by convention. If G is c-Ramsey, then by Proposition 
2 this bound is tight and we know that L(^/g) = n @ ( logn K 
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1.1 Open problems 

A shortcoming of our result is that our formula *$>g asserting that a graph 
is not c-Ramsey identifies the vertices of G with binary strings. It could 
be argued that this "binary encoding" of the statement brings some extra 
structure to the graph, and that a formalization which does not do this is 
more combinatorially natural. 

So consider the "unary encoding" ^' G , in which the mapping from an 
index i to the vertices of G is represented by n variables {p l v : v £ V(G)} and 
we have clauses asserting that for each i, exactly one of the variables p % v is 
true. Otherwise the structure of ^>' G is similar to that of *$>g- As before, if G 
is a c-Ramsey graph we have the brute-force upper bound L(^f' G ) = n°^° sn \ 
But we are not able to prove a super polynomial lower bound on resolution 
size. However if we restrict to treelike resolution, such a lower bound follows 
using techniques from [5] . Here we are able to prove the tree- like resolution 
lower bound as a corollary of our main theorem (we are grateful to Leszek 
Kolodziejczyk for pointing out this simpler proof). 

Theorem 4. Let G be any c-Ramsey graph with n vertices. Then ty' G re- 
quires treelike resolution refutations of size n^ logn \ 

Proof. (Sketch) Suppose we have a small treelike resolution refutation of the 
unary formula *$>' G . We can produce from it an at most polynomially larger 
treelike Res(fc) refutation of the binary formula as follows. Replace each 
variable p % v asserting that index i is mapped to vertex v with the conjunction 
Ab=i x l = v b- The substitution instance of ^' G is then almost identical to 
the $?Gi except for the additional clauses asserting that every index maps 
to exactly one vertex; but these are easy to derive in treelike Res(fc). 

It is well-known that every treelike depth d + 1 Frege proof can be made 
into a daglike depth al Frege proof with at most polynomial increase in size 
[15]. In particular, we can turn our treelike Res(/c) refutation of into a 
resolution refutation. The lower bound then follows from Theorem 3. □ 

Lower bounds for daglike resolution would have interesting consequences 
for various area of proof complexity [3, 11]. This is related to the following 
open problem (rephrased from [6]): consider a random graph G distributed 

according to G(n, n _( ' 1+e ' )l - T ) for some e > 0. Does every resolution proof 
that there is no /c-clique in G require size n^( fc )? For tree- like resolution this 
problem has been solved in [5]. 

1.2 Resolution width and combinatorial games 

The width of a clause is the number of literals it contains. The width of a 
CNF (ft is the width of its widest clause. Similarly the width of a resolution 
refutation n is the width of its widest clause. The width of refuting an 
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unsatisfiable CNF <j) is the minimum width of II over all refutations II of <p. 
We will denote it by W(4>). 

A remarkable result about resolution is that it is possible to lower bound 
the proof length by lower bounding the proof width. 

Theorem 5 ([4]). For any CNF eft with m variables and width k, 



Now consider a game played between two players, called the Prover and 
the Adversary. The Prover claims that a CNF (ft is unsatifiable and the 
Adversary claims to know a satisfying assignment. At each round of the 
game the Prover asks for the value of some variable and the Adversary has 
to answer. The Prover saves the answer in memory, where each variable 
value occupies one memory location. The Prover can also delete any saved 
value, in order to save memory. If the deleted variable is asked again, the 
Adversary is allowed to answer differently. The Prover wins when the partial 
assignment in memory falsifies a clause of <fi. The Adversary wins if he has 
a strategy to play forever. 

If <j) is in fact unsatisfiable, then the Prover can always eventually win, by 
asking for the total assignment. If <fi is satisfiable, then there is an obvious 
winning strategy for the Adversary (answering according to a fixed satisfying 
assignment). However, even if <j> is unsatisfiable, it may be that the Prover 
cannot win the game unless he uses a large amount of memory. Indeed, it 
turns out that smallest number of memory locations that the Prover needs 
to win the game for an unsatisfiable <j) is related to the width of resolution 
refutations. (We only need one direction of this relationship - for a converse 



Lemma 6. Given an unsatisfiable CNF <f>, it holds that W(<j)) + 1 memory 
locations are sufficient for the Prover in order to win the game against any 
Adversary. 

1.3 The clique formula 

For any graph G, the formula \y=i is satisfiable if and only if G has a 
clique of size ck. We will call this restricted formula Clique(G). Dually, 
vE'g \y=0 is equivalent to Clique(G'). Since fixing a variable in a resolution 
refutation results in a refutation for the corresponding restricted formula, 
we have 



Furthermore we can easily construct a refutation of from refutations of 
^c\y=i and vPgIV=o- I n this way we get 




sec [2].) 



max 



{L(Clique(G)),L(Clique(G))} < L(* G ). 



L(V G ) < L(Clique(G)) + L(Clique(G)) + 1. 
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We can now describe our high-level approach. To lower-bound L(^>g) 
it is enough to lower-bound L(Chque(G)), which we will do indirectly by 
exhibiting a good strategy for the Adversary in the game on Clique(G). This 
game works as follows: the Adversary claims to know ck strings in {0, l} k 
which name ck vertices in G which form a clique. The Prover starts with 
no knowledge of these strings but can query them, one bit at a time, and 
can also forget bits to save memory. The Prover wins if at any point there 
are two fully-specified strings for which the corresponding vertices are not 
connected by an edge in G. 

We will give a strategy for the Adversary which will beat any Prover 
limited to ek 2 memory for a constant e > 0. It follows by Lemma 6 that 
Clique(G) is not refutable in width ek 2 . The formula Clique(G) has ck 2 
variables and has width 2k. Hence applying Theorem 5 we get 

/( £ fc 2 -2fc) 2 \ 

L(* G ) > L(Clique(G)) > 2 V ck ' A J > 2 n ^ > n^ 10 ^ . 
1.4 Other notation 

We will consider simple graphs with n = 2 k vertices. We identify the vertices 
with the binary strings of length k. For any vertex v € G we denote its binary 
representation by v± ■ ■ ■ V)-. 

A pattern is a partial assignment to k variables. Formally, it is a string 
P = Pi ■ ■ ■ Pk G {*)0, l} fc , and we say that p is consistent with v if for all 

1 £ [k] either pi = vi or pi = *. The size \p\ of p is the number of bits set to 
or 1. The empty pattern is a string of k stars. 

For any vertex v S V(G) we let N(v) be the set {« {u, u} E E(G)} 
of neighbours of v. Notice that v E" N(v). For any U C V{G) we let 
N(U) be the set of vertices of G which neighbour every point in U, that is, 
f] veU N(U). Notice that U n N(U) = 0. 

2 Lower bounds for the random graph 

We consider random graphs on n vertices given by the usual distribution 
G(n, \) in the Erdos-Renyi model. 

Theorem 7. If G is a random graph, then with high probability L{^>g) = 

n f2(logn)_ 

We will use the method outlined in Section 1.3 above, so to prove the 
theorem it is enough to give a strategy for the Adversary in the game on 
Clique(G) which forces the Prover to use a large amount of memory. This 
is Lemma 9 below. We first prove a lemma which captures the property of 
the random graph which we need. 
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Lemma 8. For a random graph G, with high probability, the following prop- 
erty P holds. Let U C V(G) with \U\ < ^k and let p be any pattern with 
\p\ < 3k. Then p is consistent with at least one vertex in N(U). 

Proof. Fix such a set U and such a pattern p. The probability that an 
arbitrary vertex v ^ U is in N(U) is at least 2~s k = n~s. The pattern p is 

2 

consistent with at least ns — \U\ vertices outside U. The probability that 
no vertex consistent with p is in N(U) is hence at most 



_> 



nS-\U\ i 



1 — n 3 < e" 
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We can bound the number of such sets U by ns k < n} ogn and the number 
of patterns p by 3 fc < n 2 , so by the union bound property P fails to hold 

with probability at most 2~ n ( nl \ □ 

Lemma 9. Let G be any graph with property P. Then there is an Adversary 
strategy in the game on Clique(G) which wins against any Prover who uses 
at most ^k 2 memory locations. 

Proof. For each index i G [ck] , we will write p 1 for the pattern representing 
the current information in the Prover's memory about the ith. vertex. The 
Adversary's strategy is to answer queries arbitrarily (say with 0) as long as 
the index i being queried has \p l \ < ^k — 1. If |jp*| = ^k — 1, the Adversary 
privately fixes the ith vertex to be some particular vertex v % of G consistent 
with p l , and then answers queries to i according to v % until, through the 
Prover forgetting bits, \p % \ falls below \k again, at which point the Adversary 
considers the ith vertex no longer to be fixed. 

If the Adversary is able to guarantee that the set of currently fixed 
vertices always forms a clique, then the Prover can never win. So suppose 
we are at a point in the game where the Adversary has to fix a vertex for 
index i, that is, where the Prover is querying a bit for i and = ^k — 1. 
Let U C V(G) be the set of vertices that the Adversary currently has fixed. 
It is enough to show that there is some vertex consistent with p 1 which is 
connected by an edge in G to every vertex in U. But by the limitation on 
the size of the Prover's memory, no more than ^k vertices can be fixed at 
any one time. Hence \U\ < 3 k and the existence of such a vertex follows 
from property P. □ 



3 Lower bounds for Ramsey graphs 

We prove Theorem 3, that for any c-Ramsey graph G on n vertices, L{^>g) ^ 
n fj(iogn)_ As in the previous section we will do this by showing, in Lemma 13 
below, that the Adversary has a strategy for the game on Clique(G) which 
forces the Prover to use a lot of memory. 
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Definition 10. Given sets A,B C V(G) we define their mutual density by 

where we write e{A, B) for the number of edges in G with one end in A 
and the other in B. For a single vertex v we will write d(v,B) instead of 
d({v},B). 

Our main tool in our analysis of Ramsey graphs is the statistical property 
shown in Corollary 12 below, which plays a role analogous to that played 
by Lemma 8 for random graphs. We use the following result proved in [19, 
Case II of Theorem 1]: 

Lemma 11 ([19]). There exists constants j3 > 0, 5 > such that if G is a 

3 

c-Ramsey graph, then there is a set S C V(G) with \S\ > n* such that, for 
all A,B C S, if \A\, \B\ > |5| 1- ^ then 5 < d{A, B) < 1 - 6. 

Now fix a c-Ramsey graph G. Let S, /3 and 5 be as in the above lemma, 
and let m = \S\. Notice that since our goal is to give an Adversary strategy 
for the formula Clique(G), we will only use the lower bound 5 < d{A,B) 
from the lemma. 

Corollary 12. Let X, Y u Y 2 , . . . , Y r C 5 be such that \X\ > rm 1 " 13 and 
1^1 1) ■ • ■ i \Y r \ > m l ~P . Then there exists v € X such that d(v,Yi) > 5 for 
each i = 1, . . . , r. 

Proof. For i = 1, . . . , r let 

Xi = {u G X | d(u,Yi) < 5}. 

By Lemma 11, each \Xi\ < m 1- ^. Hence X \ [J i Xi is non-empty and we 
can take v to be any vertex in X \ [J i X^. □ 

The next lemma implies our main result, Theorem 3. 

Lemma 13. There is a constant e > 0, independent of n and G, such that 
there exists a strategy for the Adversary in the game on Clique(G) which 
wins against any Prover who is limited to e 2 k 2 memory locations. 

Proof. Let e > be a constant, whose precise value we will fix later. As in 
the proof of Lemma 9, the Adversary's replies when queried about the ith 
vertex will depend on the size of p 1 , the pattern representing the current 
information known to the Prover about the ith vertex. If \p l \ < ek — 1 
the Adversary can reply in a somewhat arbitrary way (see below), but if 
\p l \ = ek — 1 then the Adversary will fix a value v l for the ith vertex, 
consistent with p 1 , and will reply according to v % until \p l \ falls back below 
ek, at which point the vertex is no longer fixed. By the limitation on the 
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Prover's memory, no more than ek vertices can be fixed simultaneously, 
which will allow the Adversary to ensure that the set of currently fixed 
vertices always forms a clique. 

Let S, f3 and 5 be as in Lemma 11 and let m = \S\. We will need to 
use Corollary 12 above to make sure that the Adversary can find a v l with 
suitable density properties when fixing the ith vertex. But here there is a 
difficulty which does not arise with the random graph. Corollary 12 only 
works for subsets of the set S, and S may be distributed very non-uniformly 
over the vertices of G. In particular, through some sequence of querying and 
forgetting bits for i, the Prover may be able to force the Adversary into a 
position where the set of vertices consistent with a small p % has only a very 
small intersection with S, so that it is impossible to apply Corollary 12. 

Let a be a constant with < a < (3, whose precise value we will fix 
later. We write C p for the set of vertices of G consistent with a pattern p. 
We write P e k for the set of patterns p with p < ek. To avoid the problem in 
the previous paragraph, we will construct a non-empty set S* C S with the 
property that, for every p £ P^, either 

C p nS* = or \c p ns*\ > m x - a . 

In the second case we will call the pattern p active. The Adversary can then 
focus on the set S* , in the sense that he will pretend that his clique is in S* 
and will ignore the vertices outside S*. 

We construct S* in a brute-force way. We start with Sq = S and define 
a sequence of subsets So, Si, . . . where each St+i = St\C p for the lexico- 
graphically first p € P ek for which < \St H C p \ < m 1_a , if any such p 
exists. We stop as soon as there is no such p, and let S* be the final subset 
in the sequence. To show that S* is non-empty, notice that at each step 
at most m l ~ a elements are removed. Furthermore there are at most \P e k\ 
steps, since a set of vertices C p may be removed at most once. Recall that 
n = 2 k and m > n*. We have 

\P ek \ = fjl l Q < ek ■ T k < ek • 

where H{x) is the binary entropy function —x log x — (1 — x) log(l — x), and 
we are using the estimate (*L) < 2 kH ^ which holds for < e < 1. Then 

\S*\ > \S\ - \P ek \ ■ m 1 '" > nf - ek ■ n^^n^ 1 " ), 

so, for large n, 5* is non-empty as long as we choose a and e satisfying 

la>e + H{e). (*) 

Notice that if S* is non-empty then in fact \S*\ > m 1 "", since S* must 
intersect at least the set C p where p is the empty pattern. 
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We can now give the details of the Adversary's strategy. The Adversary 
maintains the following three conditions, which in particular guarantee that 
the Prover will never win. 

1. For each index i, if \p l \ < ek then p % is active, that is, C p % n S* ^ 0. 

2. For each index i, if \p l \ > ek then the ith vertex is fixed to some 
v l £ C p i n S*; furthermore the set U of currently fixed vertices v J 
forms a clique. 

3. For every active p S P t k and every U' C U, we have 

\c p ns* nN(u')\ > \c p ns* \ -5 lu ' 1 . 

These are true at the start of the game, because no vertices are fixed and 
each p 1 is the empty pattern. 

Suppose that, at a turn in the game, the Prover queries a bit for an 
index i for which he currently has information p 1 . If \p l \ < ek — 1, then by 
condition 1 there is at least one vertex v in C p i n S* . The Adversary chooses 
an arbitrary such v and replies according to the bit of v. If \p l \ > ek, then a 
vertex v l G Cy is already fixed, and the Adversary replies according to the 
bit of v l . 

If \p l \ = ek — 1, then the Adversary must fix a vertex v % for i in a way that 
satisfies conditions 2 and 3. To preserve condition 2, v % must be connected to 
every vertex in the set U of currently fixed vertices. To preserve condition 3, 
it is enough to choose v l such that 

d{v\c p ns* nN(u')) > \c p ns* nN(u')\-5 

for every active p in P e /% and every U' C U. To find such a v l we will apply 
Corollary 12, with one set Y for each pair of a suitable p and U'. We put 

x = c pi n iv(c/) n s* 

Y{j>,u') = C p C\ N(U') Pl 5* for each active p G P t k and each U' Q U 
r= |{pairs (p,l7')}| < |P ei; | • 2™. 

We know |C/| < ek. By condition 1 we know p l is active, hence \C V % <T\S*\> 
m 1 " . So by condition 3 we have 

\X\ > m 1 ""^ = m 1 - a+ t el °s <5 . 

For similar reasons we have the same lower bound on the size of each YL j/a. 
Furthermore 

r<T k -ek- n* +H ^ = ek ■ n 2 * +H ^ = ek ■ m^ + l H ^. 
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To apply Corollary 12 we need to satisfy \X\ > rm 
Both conditions are implied by the inequality 



and \Y ( 



(p,U') 



> m 



1-/3 



p-a> §e+ §#(e) - §elog& 



(t) 



We can now fix values for the constants a and e to satisfy the inequalities 
(*) and (f). Since H(e) goes to zero as e goes to zero, we can make the right 
hand sides of (*) and (f ) arbitrary small by setting e to be a small constant. 
We then set a appropriately. 

Finally, it is straightforward to check that if the Prover forgets a bit for 
an index i, then the three conditions are preserved. □ 
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